
Europaisches Patsntamt 
European Patent Office 
Office europeen des brevets 



illlillllilliHiiii 

@ Publication number : 0 495 622 A2 



® 



EUROPEAN PATENT APPLICATION 



@ Application number : 92300^2.4 
@ Date of faing : 14.01.92 



@ int ci.= : G06F .15/401, G06F 15/403 



(g> Priority : 14.01.91 GB 9100733 

@ Date of publication of application : 
22.07.92 Bulletin 92/30 



@ Designated Contracting States : 
DE FR GB 



@ Applicant : XEROX CORPORATION 
Xerox Square 

Rochester New York 14644 (US) 



Inventor: Lamnting, Michael G. 
179 Hnis Road 
Cambridge CB2 2RN (GB) 
Inventor : Newman, William Maxwell 
Yew Tree Cottage, Famham 
Blandford Forum, Dorset (GB) 



g) Representative : Goode, Ian Roy et al 

Rank Xerox Patent Department Albion House, 
55 New Oxford Street 
London WC1A 1BS (GB) 



^ Indexing of data sets. 



@ A system is disclosed for facilitating later 
access to a portion of a datastream for retrieval 
purposes based upon identifying "episodes" in 
data from other sources related to a user's 
activities. The stored data, in raw or "episodic ' 
form, are accessed as desired using parameters 
based on recollections of the user's or database 
interrogator's activities at the time of the event 
which the user or interrogator wishes to access 
or display. Such other sources may indi/de 
information about the movements, gatherings, 
conveisations, phone calls, etc. of users in a 
particular environment fitted with the system. 
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This invention relates to the activity-based index- 
ing of data sets, in order to facilitate the fast retrieval 
at a later date of data of interest. The invention is 
based on there being at least one set of data defining 
activity. The netrieval process Involves identifying an 
index point in a data set. Where there are multiple 
data sets, having index points in one of them facili- 
tates access to the other set to reach the data in it at 
the point in time, or period, contemporaneous with the 
identified index point 

The invention focuses upon improving access to 
information that humans find particularly difficult to 
retrieve. It exploits a suggested autobiographical 
model of human memory which observes that humans 
seem to organise their memory in episodes. 
Associated with each episode are the properties; 
Where? When? With whom? and. In pursuit of what 
goal? an episode occurred. Humans seem able to 
recall these key properties of an episode long after the. 
details of the episode itself have been forgotten. The 
present invention is concerned with buflding a system 
that enables retrieval of episodes that would other- 
wise be difficult or impossible to retrieve. Including 
matters for which records are not normally kept This, 
system has been called a 'Human memory pros- 
thesis*. 

The present invention relies on a variety of ; 
apparatuses for nrtonitoring, recording and time- 
stamping various key aspects of human activity, from ^ - 
which a readable list of episodes, or 'activity diar/; 
can be generated automatfcally. This diary, which cor- 
relates quite closely with human recollections of past . 
episodes, can be used simply to stimulate further 
recall, or to identify an episode and its time of occun-- 
ence. or to show a set of episodes firom drfferent sets. 
The times may be used to locate the . corresponding 
items in any other set of time-stamped data, such as , 
a sequence of video frames or an audio record of a . 
meeting. 

In what follows, an 'episode' Is defined as - a ; 
period of time containing a memorable sequence of 
events By dividing a long series of events into such 
episodes, sequences are fonmed that are particularfy 
memorable. Episodes have flags associated with 
them that characterise them and can be used for ret- 
rieval. 

One form of such apparatus was disclosed in an 
article in The Sunday Times . 11 February 19go, enti- 
tied "Computers acquire social skills", by Jane Bird. 
This mentioned the use of identity buttons to enable 
the computer to know who was where in the building 
at all times. Associated with the system is a network 
of microphones and/or video cameras capable of 
recording conversations. It gave the example of a 
copier in a room monitored by the computer. Upon the 
copier's developing a fault, a frustrated would-be user 
tries to recall what a colleague told him about how to 
cure the fault The computer is interrogated to find the 



last time that the two people concerned were at the 
copier. Once this particular bit of information had been 
recovered, it was used to access the Ume-stamped 
audio records of what was said in the room containing 
5 the copier at that point of time. In this way, the enquirer 
had the opportunity of hearing again the conversation 
between him and his colleague. A system of the 
foregoing type also is described in a copending U.S. 
Patent Application of the instant inventors, which was 
10 filed 7 February 1 991 under Serial No. 07/652, 159 on 
"Indexing of AudioA/isual Data". 

The aforementioned system has several practical 
limitations. These indude the need on the part of the 
user to devote a lot of effort to identifying a particular 
15 time and place from the raw identity,button data, and 
the lack of precise episode definition from such data. 
Also the system captures just the location of users, 
and not their activity or other significant . episodes. 
These drawbacks would make the system of relatively 
20 little practical value. , » 

Another relevant publication (n this area of 
technology is tiie article entitled "Reflexive CSCW; 
supporting long-tenm personal work" by H Thimbleby,' 
. S Anderson and I Witten, Interacting with Computers . 
25 volume 2, No. 3, December 1990, pages 330-336. 
The tenrn "CSCW" means - computer-supported 
cooperative worie The section under the heading 

"Background ", on pages 331/2 of the article, refers to 
a method whereby a research worker is able to be 
30 given on request a record of workstation activities he 
was undertaking over a chosenperiod. The article 
suggests that this approach can generate useful infor- 
mation for retrieval purposes, but offers no practical 
means of doing so. Uke the first mentioned system , 
35 it fails to recognise the need for analysis of the raw 
data to produce user-recognisable episodes. 

The present invention is based on devising pat- 
^ tem-matching techniques for constructing recognis- 
able episodes from raw data. In this way it offers the 
4o user the means to anive quickly at a particular point 
in one dataset possibly in order to be able to obtain 
access more efficiehtiy and quicWy into at least one 
other dataset. This speed of access ts one of tiie 
essential requirements in making tiie technique of 
45 practical value. r . 

The invention also takes into account situations 
where there may be relatively infirequent. change of 
location by users, i.e. where location data would be of 
littie use in identifying episodes with precision. In 
50 these situations, other forms of activity are likely to 
provide a more effective means of episode identifi- 
cation. The invention therefore presents means of 
instrumenting tiiese activities and analysing the raw 
data, thus consbucting recognisable episodes from 
55 these multiple sources of activity data. The invention 
also observes ttiat retiieval often involves correlating 
activities of several different kinds, and therefore pro- 
vides means for doing so. 
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The present invention wBl now be described by 
way of example with reference to the acmmpanying 
drawings, in which: 

Fig. 1 is a diagrammatic view of a baste data-ret- 
rieval system of the present invention, in which 5 
the raw actrvtly-based data are broken down into 
episodes which may be selectively fed to various 
output devices; 

Fig. 2 is a view of a more-complex system than 
Fig. 1 . using two or more of the basic systems of io 
Fig 1 feeding into a compound episode recog- 
niser so as to be able to establish relationships 
between different forms of Input data; 
Fig. 3 is a diagrammatic view of one system of the 
present invention using 'active badges" to feed is 
movement information into the system; 
Fig. 4 is a diagrammatic view of a sut>-system for 
using badge episode data produced by the sys- r ' 
tern of Fig. 3. to identify when two or more people 
form a gathenng. and for forming a travelogue of 20 
the movements of people carrying badges be- 
tween different zones in the environment t>eing 
monitored; ' 

Fig. 5 shows in (a) a diary of indivlduaJ events in- 
the life of one person, and in (b) these events stm^ 25 
plified by amalgamating short periods alone into 
attendance at a gathering; 

Fig. 6 is a diagramnnatic view of part of a system - . 
of the present invention in which the activity being 
monitored is the presence of various documents 30\ 
on the user's vsKsrk space, with at least descriptors . 
of those documents being stored to complement ■ r 
the other data assisting in later retrieval; 
Figs. 7(a) to (c) are pa rtiaJ displays of a typical 
output of the Fig. 6 system; . . ' 35 
Fig. 8 is a diagrammatic view of a sub-system 
based on that system shown in Fig. 6, to break • 
down the presence of documents on the work- 
. space, and the timing of the appearance on, and . 
disappearance from, the workspace, to produce ' 40 
document-movement episodes; 
Fig. 9 is a diagrammatic view of a sut>-system of 
the present invention in which- the sounds 
received and emitted by a person wearing a 
microphone are subjected to a filtering action to . 45 
determine when the person carrying- the micro- 
phone is talking, from when he is not By the use 
of a compound episode recogniser as shown in 
Fig. 2. conversations between two or more people - . 
can be recognised as such and can be used to so 
form episodes for retrieval purposes; 
Figs. 10(a) and (b) are views of typical outputs 
from the Fig. 9 system, showing how convers- 
ations are distinguished from concurrent talk; 
Rg. 1 1 is a diagrammatic view of a sub-system 55 
rather similar to that of Fig. 8, in which the input 
Is of telephone calls made or received by an indi- 
vidual in the environment being monitored. By 
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means of signals derived from incoming calls, or 
from utterances of the person being nrranitored. 
such phone calls can be formed into episodes, 
with any resultant diary including the names of the 
people with whom the person conversed over the 
telephone, irrespective of whether or not that per- 
son identified himself by name, or was identified 
by name by the person being monitored; and 
F^. 1 2 is a diagrammatic view of a sub-systom in : 
which the activity being monitored is that of the 
use of the electronic workstation of the person 
being monitored. 

In the drawrings. many parts are spectftcally label- 
led as well as being given reference numtiers. When 
components or units are comnrKm to two or more of the 
Figures, they retain their original references. 

The data-rebieval system of this invention -is 
intended to operate on activity-based data from a 
choice of sources. " 

As shown in Fig. 1, the data may be fed directly 
to a menrory 4, prior to processing by the episode rec- 
ogniser 6; or the recogniser might operate on raw data 
and store the processed data In a mentory 8. Because 
the information to be retrieved is not known at the time 
of data capture, it is necessary to be able to specify 
the appropriate characteristk:s of the data to be ret- 
rieved by means of a query station .10. having a 
keyboard'12 which the user can use to alter the par- 
ameters of the recogniser 6 so that the raw data are 
processed v- into episodes having predetenmined 
characteristics. 

The output of the recogniser 6 is fed into a diary 
composer 14 that is adapted to process the episode 
data fed to.it and produce output data In the form of . 
diary information which;ts fed to a driver 16. Although 
the driver is shown as operating a printer 18. a visual 
display unit 20 and a loudspeaker or other audk> 
device 22, only one or none of these devices may t>e 
used in any specific embodiment, tHJt there should 
always be at least one output device able to present 
the data or information to the user,:and to enable him 
to identify a portion of the data which he associates 
with a particular episode he is trying to recall, so that 
the time-stamp of that episode then can be used to 
access video or audio recordings of the environment 
in which he was interested at the time of the episode 
in question.- 

Although no specific time-keeping devk;e is 
shown in the Fig. 1 embodiment it is to be understood 
that an forms of apparatus according to this invention 
have to be able to operate on only time-stamped data. 
As most microprocessors which form components of 
the system of the present invention have built-in time 
clocks, whereby the microprocessor output Is time- 
stamped, even if the user is not always able to see the 
time-stamp on first viewing,- but perhaps only on 
demand, the drawings have omitted the dock input 

That apparatus shown In Fig. 2 uses two dHTerent 
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sources 2 of raw of input data with two appropriately 
configured episode recognisers 6. Each episode rec- 
ogniser feeds its output to a compound episode rec- 
ogniser 24, of which the episode-recognition 
parameters are set by a query device 10, with the 
queries being passed as necessary upstream to each 
episode recogniser 6 feeding its output to the com- 
pound recogniser 24. This invo»vement of the indivi- 
dual recognisers with the compound recogniser is. 
necessary in order to enable the latter to sort data into 
compound episodes involving data of two different 
types combined in a way specified by the query sta- 
tion 10. In a nnanner similar to the sub-system shown 
in Figure 1 , the compound recogniser 24 feeds its out- 
put to diary composer 14 and an output device driver 
16 connected to selected output devices (not shown) 

As will be appreciated, the diary composer 3 is . 
application-dependent, whereas the episode recog- - 
niser2 is data-dependent The compound recogniser 
accepts both simple queries, as in the Fig. 1 system, 
as well as compound queries specifying relationships 
between episodes from different sources. 

The system shown in Fig. 3 is based on the use 
of encoded identifiers, each intended to be carried by 
people woricing in the environment being monitored. 
One particulaiiy-convenient fonm of identifier is the 
■active badge" offered by Olivetti . This identrfiertakes . 
the fonm of miniaturised drcuitry in a housing able to 
be pinned or clipped to the clothing of the person to 
whom the identifier is issued. The circuitry is designed 
to emit pulse width modulated infra-red coded signals 
for a tenth of a second every 12 to .lS.seconds. The 
signals, by their nature, have a range of aboutsix met- 
res, and will not travel through walls, which makes . 
them very useful for keeping track of people moving 
between rooms In a normal working environment In 
each of the rooms or corridors of interest (which would 
normally exclude toilets, lifts and like utility spaces) 
would be positioned one or more sensors responsive 
to the infra-red output signals. The sensors would be 
connected to a master station processor, which would 
have the task of polling the sensors for identifi- 
cation 'sightings'; processing the data (which would 
include decoding the signals to detemitne which per- 
son was within range of a specific sensor at the time 
of polling), and presenUng the processed data in a 
visual fonmat. 

The badge system shown in Fig. 3 uses a network 
of sensors 26 of which there is at least one in each 
zone of the building or other environnrtent being moni- 
tored, in large rooms, the effective range of the 
badges might necessitate there being several sen- 
sors unifonnly distributed over the room, as by being 
mounted from the ceiling, to ensure that each occup- 
ant who is carrying a badge is in range of at least one 
sensor, so that the room has no - blind spots'. The sen- 
sors 26 are connected to a polling device 28 which 
sends pulses to the sensors at chosen intervals, and 
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receives firom the sensors encoded signals indicating 
the badges within range of each sensor at the time of 
polling. The poller 28 supplies this badge infomtation 
to a badge server 30 having its own data memory 32. 
The badge server 30 supplies its signals to a badge 
database 36, having a data memory 38, so that the 
badge signals are decoded and the identity of the per- 
son allocated a badge becomes part of the badge sig- 
nal. The badge server 30 accepts the poller data and 
maps sensor signals into people and locations using 
its own database 32, and maps the detection of a par- 
Ucular person in a particular location at a specific time 
Into anrival and departure events. The database 36 
stores badge event data if appropriate. A badge 
episode recogniser 40 operates ori the output from 
the database 36 to characterise the tiadge signals into 
three types of episodes: gatherings; periods alone, 
and travel between or away ifrom moiiitored locations. 
Similarty to what has been already described, the rec^ 
ogniser 40 has its output signals treated by diary com- 
poser 14 and driver 16. . 

The sub-system of Fig. 4 details the way in which 
the badge event signals may be processed. The first 
stage of the badge episode recogniser 40 of Fig. 3 is 
a gatheririg recogniser 42. which receh^es badge 
event signals, and uses an event to keep a record of 
who is at each location, and notes the formation and 
dispersion of "gatherings". This tenm is a convenient 
one to use, as signifying an episode worthy of being 
recorded. A working definition of a "gathering" is that 
it is regarded as an - occasion which has started when 
two or more people are present concurrently at a 
specified location, for more than a specified minimum 
period, and at least one of the participants is not resh 
dent at that location This implies that the location 
database forming part of the environment identifies 
which is the office of each badge holder, so that his 
presence in his office is recognisably different from 
the presence of someone else in his office. The 
gatherings data are stored for use by a travel organ- 
iser 44, which receives badge events and supplies 
signals to a stopover list 46; the stopovers including 
locations where, more than a minimum period was 
spent locations at the tenminus of a round trip of more 
than a minimum distance, and the locations of gather- 
ings. 

Data from a floor-plan store 48 are used to recog- 
nise -round trips", by which tenn. is meant - when a 
person being monitored leaves his office and moves 
around the environment before retuming to it -. The 
stopover list 46 incorporates details of gatherings that 
are fed to the diary composer 14. The diary composer 
14 operates as illustrated in Fig- 5. The raw data 
showing a small part of the movements of one indivi- 
dual shown in Rg. 5 (a). This reiatfvely-raw data ywoukJ 
lead to an unnecessarily-long diary or event log. Thus 
the diary composer 14 operates on this information to 
combine sequences of consecutive episodes that 
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match specific patterns. Thus, for the activities of the 
person shown in Rg. 5 (a), the periods when he is 
alone in the conference room are treated as being part 
of the general gathering in that room, thus leading to 
the simplification shown in Fig. 5 (b). 5 

Fig. 6 shows the system which exploits scanning 
and image-processing technology to read the con- 
tents of documents on the desk surface; the user is 
then able to interact with these contents using a 
pointer such as a tablet and stylus. A wide range of io 
interaction techniques may be used, including menu 
selection and dialogue boxes', graphical selection and 
hand printed character recognition. Moreover.the 
user is also free simply to Wite ondocuments with a 
pen or with an ink pen in the stylus. Particular tasks is 
for which the system could be effective indude: 
selecting key words for document filing; performing 
calculations on numeric data; translating foreign 
words, looking up technical terrns in dictk>naries, 
checking and filling in forms; retrieving other docu- 20 
ments or information that cannot be printed, e.g. audio 
or vkleo. The invention oould therefore give desk 
workers and people attending meetings access 'to 
functions that are cunrentJy available only via work sta^ 
Uons. The apparatus shown in Fig. 6 includes a video 25 
camera 50 and a video projector 52 mounted above 
a desk 54 at which the user sits, a data tablet 56 with 
a stylus 58, a high-resolution scanner 60, and a con- 
trol computer 62 with its data store or memory 64 for *■ 
images. ' 30 

The camera 50 is mounted so that it field of view- ' 
Includes the document on the desk 54 in front of the - 
user. The video signal b digitised using a frame store, 
creating a bit map image of the document . Digital fB-- 
tering techniques are applied 'to" the image, first to 35' 
determine size and rotation, and then to classify the 
document according to properties such as line spac- 
ing, line-length distrilMJtion,' etc. Application of these 
techniques provides (a) a compact resdlutiorvinde-^ 
pendent encoding of the images properties for use as 40 
a descriptor, and (b) oriehtatton and position infor- * 
mation. The data tablet 56 and stylus 58 permit the 
user to select items such as words' or coturhns of num- 
bers. When the user makes a selectroh as by pressing 
down on the tablet surface, the coordinates of the 45 ' 
stylus 58 are transformed, using the inverse of the 
rotation and position information, to determine the 
selection in the coordinates of the page. 

If the image scanned in from the camera 50 is of 
insufficient resolution to support interaction, the docu- so 
ment can be scanned in at high resolutton. This oper- 
ation is done via the sanner 60. and it could be 
perfomned on the entire contents of the user's in-tray 
before a work session, or on an entire set of meeting 
papers before a meeting starts. The some digital fil- 55 
tering techniques as before are applied to the high- 
resolution page image to construct a descriptor and to 
correct for rotation. Optical character recognition 



(OCR) is aprriied to extract the text of the document 
This text is stored for later use, indexed by the docu- 
ment descriptor. The page coordinates of each text 
item are included with the stored text 

The memory 64 contains layout information for 
each document page entered into the system, 
indexed by descriptor J The user need not apply high- 
resolutktn scanning or OCR to a document page that 
is already stored. As soon as the descriptor has been 
generated from the vkteo scanned data, the existence 
of a stored version can be determined. Layout infor- 
mation describing computer-generated documents 
can be written directly to the memory 64, and descrip- 
tors generated, without recourse to scanning or OCR. 

The descriptor of each page of a mutti-page docu- 
ment, or of each single document, is non-unique, but 
is designed to ensure that descriptors are rarely dup- 
licated. Descriptors for pairs of document images may 
be compared using a conrelator to determine if the two 
images aris of the same document When the user 
selects ah item with the stylus, the corrected coordi- 
nates are used to identify the text item in the stored 
version: The appropriate function (translation, index- 
ing, etc.) can then l>e applied to the item. The system 
can also supply hand-printing recognition to the stylus 
input, or record changes to the scanned image result- 
ing from conventional handwriting. 

Feedback to the usier is provided via the video 
projector 52. Examples of such feedback are shown 
in Fig. 7i Results of translations (Fig. 7a) or calcu- 
lations (Fig. 7b) would normally be shown alongside 
the text} If space permits (and the system can deter- 
mine this) the results could be displayed directly 
at>ove the selected iterii. The' vkleo projector 52 also 
displays menus and diatogue* boxes (see Fig. 7c) for 
interactive control. • ' ^ 

Each time the user places a new document on the 
tablet or turns the pages of a multi-page document, 
the system can detect the change in the video-scarv 
ned image and theri recompute the invage's descrip- 
tor. It searches for a matehing descriptor in its file of 
documents. Thus if it finds a match, it retrieves the 
documerit. If, on the other hand, no match is found, it 
signals to the user to feed the document into the scan- 
ner 60. 

This emtjodiment of the invention can be com- 
pared either with workstation-tiased interaction or 
with traditional paper-based work practice. As an 
alternative to workstation-based interaction, it has the 
advantage that it can deal with any incoming paper 
document, or print-cut of an electronic document 
Compared with paper-based methods, it has the 
advantage of providing access to a number of com- 
puter-based functk}ns and servk:es. It has particular 
advantages in meeting rooms, where use of paper 
documents is a firmly-trenched practice. 

The 'audit trafl" left by the system provides a 
source of data about the user's interaction with paper 
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documents. These data can be used to constml 

either by selected key words or by optically^ cor^ 

from the computer 62 and am«ry 64 of the Fia 6 , 
embodiment The computer feeis raw dL™„! 
events to a document episode recogniser wTe 
■ !!f"T^''''*"^''^"^'''°'^-"''™~"<feandsel^e^ 
fj» . . ^«<»9™=e^ bunds episodes from consecu- 
ftve events applied to the same document: episode «. 
descnptjons include time-stamp. descriptor and ^ 
Of ep^ode , including reading, editing. ™„g. calculi , . 
;ng). The dery. composer 14 lakes episodes describ- 
ing individual document pages and buBds .episodes 
."^dotllr — «^ — and.on. , „ 

.oder:;r^ra;r.rj^~ 

not tell Whether mese peopte torn, a ^u^ 

vS^nVtr'"" "-'"9 -^-"ility todetL.^,. " ,0 
versatons between peopte, without necessarSy 
recording their voices, so as to provide a miSf^e 
accurate indicator of group activity. Usen^ ty^^" 
r^^ember conven^ations betler than , hey ren^,^^ 
otherpar^cipantspresencesptheabaitytodetectth^ . 

re« 7!T°" '"'"'y "e useful tor 

retneval. In the Fig. 9 system, a microphone 68 is c^- 

duced by the microphone is monitored by means of a 
circuit of the type used in cassette recordem: The2 'so 
cu. controller 70 transmits a signal whenever "e . 
energy level of the audio signal exceeds a.threshold 
^PJ^enhng the wearer's . minimum voice ^ lelt. ^ 
Th«e signals, and their Ume of receipt, are collected 

All of the energy data sets from a particular loca- 
bon are analysed together. The analysis s«tet " 
Identity periods of high energy as utternces.tTd L 

fonr, continuous conversations. The methods draws 
on social science research results showing that^^ 

ly-short pauses between turrs. Thus the composite 
audio stream is more or less continuous. wT^dy « ' 

onh^nespeakeractive.Fi9.io(a)shows.heaS 
generated by participants in a typical conv«t 
abon. While Fig. ,o (t) shows, in conbS? Z 

o^l? P^"*""* Of sflence. non- 

oveHapping speech, and overtapping speech In true 
com,e,«a.K>n between h»o people, ovldap d ,^d 
to represent less than 10 percent of all non-s len~ 

hon w II be considerably higher, at least 25 pen:enr 
Although this has not been shown in Z dt^l 
■ngs. an alternative method of caphiring u,e cotnrei^ 



10 



ation data is by radio telephony. Audio skinals 
users-^micmphones ate co^ecti^ a. Z 
(.on. heir energy levels are measured, and the 
versaborvdetection analysis earned .^it This ?a 

bandwidth but It mayencounter Channel limitaBonT 
The diary composer 14 reduces the llst^ 
episodes by excluding short episodes, and by 1^ 
com^rsations between Wendcal groups ^ 
Fig. 11 Shows a sub-system that builds a loo of 
mcoming and outgoing telephone calls that in c^X 
*e names Of respondents. The proposed Se^ 

Cbleo^tnr,'"'"'"^^**'''''''"-^''--^^^ 
^ble.of building a text database, each record ind^ , 

1^ and * ""^ and.durationof a call, and both the^' 
ler^s and the recipienrs riames, with one of the names 

bernggeneratedbyaspeech-recognitionsysZ^ 
sys^shown ^ Fig, li uses in^t trom 
76 from «h,ch signals are picked Off and fed 
speech recogniser 78 that has its output iout«| to a 
elephone call episode recogniser 80. 'with^^' 
"Hum. going to the no,««ual composer 14. Ou^ 

setofentnesisshownat84. 

n.r.?'^ the invention relies on recognising 

calls and in makmg outing calls 

On making ou^oing calls the speech recogniser 

»«"a' contiZS" 

Johnson please- or -Could I speak to Patrick" eTZ 
long as the peison being called has cor^p ^^i^ 
voice «lentification data entered in a data^^eS 
userance by the caller of the spoken name 
the audio.signals Which are fed to rocognfeer 80 ^ 
«c»g™ser 80 generates the recipie^s name a^ 
text sting. On receding incoming calls, the speech 

a^rr t" '^" '*^'*""^' '» ^ reclpiem-r,:^ 
ance of the foro, THello PatHck- or -Hello Mr Johnson- 
andgenerates the caller's name as a text string T^fe 
recognisergenerated data is stored, so it may be 
for subsequent manual confim»,ion by the ^ 

calls to ft^P' ""^"^ 

cans to the same person, and ^nore wrong numbeis 
and interceptfon by switchboards, eta If the 
not known to the user, or if the user calls sc^^ 
whose data.is not in database 82 the svstem^n!^ 
supply the idenfificatton data. Thero is ^r^e 
siblity mat. in the case of an incoming ciT the ^ 

Which the recogniser 80 is programmed. ThfeTaT 

mL^^ ^1 '^">SniVon is low, can be used to 
markthe caller as unknown in the log entn, 

Thesub-system shown in Fig. 12usestheactivitv 
ofa tier's worfcstabon to generate episode dat^A 
workstauon activity episode recogniser 86 has a d^b 



NSDOCID: <EP__CMSS6a2A2_L> 



11 



EP 0 495 622 A2 



12 



memory 88. The recogntser has various fonns of input 
from a workstation, as shown in the drawing. One 
input might relate to fae activity, another to processor 
focus, and a third to prbgrain* input focus. These 
inputs , are from daemons or watchdog processes. 5 
These monitor activity such as: the o-eation. reading, 
modifying or deleting of files; the name of a program 
consuming processor resource in a given period, or 
changes in program input focus (in a window-based 
user interface). The recogniser 86 translates sequ- io 
ences of activity into user-recognisable episodes. 
These include; editing a document; reading, answer- 
ing and faing mail; developing a program; and retriev- 
ing information from a database. The recogniser 86 
applies pattern-matching to recognise common is 
tasks , i.e. "change input 'focus to editor ^ change 
processor focus to editor "-> read ffle^ write file 
write file -> change input focus", would be recognised 
as being the steps involved in editing a document The 
composer 1 4 corribines siimilar episodes and removeis 20 
insignificant episodes. 

Users can subrhit queries to 'each of the recognis- - 
ers described above by means of its respective query 
input 1 0. A query will specify an episode or episodes^ 
of interest, including a pattern of ejpisodes,' using key 25 
words and descriptions appropriate to the recogniser, 
for example: 

- with KBW in Commons last week;' ' 

- document read after "reading Mid-year Report; 

- talking to KBW and MGL; 30 

- calls to MGL yesterday; a rKJ • ' 

- reading mail yesterday.' 

In the case of the compound recogniser 24 (Fig 
2) such a query might involve a pattern of the form: 

- sending mail to KBW after meeting with MGL. 3S 
Although the different embodinr^ents of the inven- 
tion are implemented in the form of computer code, 
which is not included in this specification, it is never- •' 
theless believed that the above description and draw- 
ings are sufficient to enable a man skilled in the art of ' 40 
designing software systems to put this' invention into 
practice. 



Claims 45 

1 - A system for providing for fast random access to 
a sequence of time-stamped data records of 
events in a specific environrnent, compr^ing: 

means for collecting data in order to con- so 
struct time-stamped records indicating the !oca- • 
tions of persons in the environment; 

means for encoding and storing the 
records for later selective retrieval and decoding; 

means for specifying a pattern characteri- 55 
sing an episode as defined, and for receiving and 
decoding matching records; 

means for applying rules to the records to 



anrrve at records of episodes matching a specified 
search pattern, in which each episode record is 
made up of periods spent alone by the person 
concemed; in travel within the environment, and 
in attendance at gatherings, and 

means for requesting transformation into 
human-readable form of ttie episode records cor- 
responding the specified pattern. 

2. A system as claimed in claim 1 , in which the envi- 
ronment is divided into several zones, and includ- 
ing: ' ' 

' mearis for monitoring one or more of the 
zones to detect the presence within the zone of, 
= and to identify, a person. 

3. A system as claimed in dairri 1 or 2. in which at 
least some of the people in the environment cany 
a personal tdentrfier from which data can be col- 
lected by one or more monitors of the envirorv 
ment 

4. A systenri as -claimed in dalm 3, in which each 
identifier is in the form of a transmitter of identift- 
catibn signals to virhich each monitor is sensitive. 

5. A system for providing for fast random access to 
a sequence of time-stamped data records of 
evente in a specific'environmenl, comprising: 

"' means for collecting data in order to cor>- 
struct time-stamped records identifying scann- 
able document pages and indicating operations 
on the pages at locations in the environment; 

means for encoding and storing the 
- records for later selective retrieval and decoding; 
means for enabling identification of the 
document contents from a record. 

6. A system as claimed in claim 5. including means 
for projecting a display on to the surface of the 
document or on to' a surface txirdering the docu- 
ment- ' 

7. A system as dainried in daim 5 or 6. induding 
means for manually selecting from within this 
document or for displaying a computer function to 
be applied to the document 

8. A system as daimed in any of daims 5 to 7, 
induding means for manually selecting one or 
more ttenr\s on the identified document to be used 
as input to a computer function to t>e applied to 
the document 

9. A system for providing for fast random ac^ss to 
asequence of time-stamped data records of 
activities in a specific environment , comprising: 

means for collecting data in order to con- 
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Struct time-stamped records of periods of vocal 
sounds by people in the environment; 

means for operating on those records to 
detemnine when a conversation ts taking place 
involving one or more of the said people, and for 
generating time-stamped records of such deter- 
minations; 

means for encoding and storing the 
records for later selective retrieval and decoding 
and 

means forenabling identification of the col- 
lected conversation data from a record. 

1 0. A system for providing for fast random access to 
a sequence of time-stamped data records of 
activities in a specific environment, comprising: 

means forcollecUng audio signal and other 
data from a person in the environment placing or 
receiving a phone call in order to construct time- 
stamped records of calls; 

means for feeding audio signals from 
these records to a speech recogniser capable of 
identifying the other participant in the call from the 
words uttered by the person in the environment, 
and for adding this identification to the records; 

means for encoding and storing the 
records for later selective retrieval and decoding 
and 

means for enabling identification of the call 
data from a record. 

30 

11. A system for providing for fast random access to 
a sequence of time-stamped data records of 
actMUes in a specific environment, comprising: 

means for collecting data in order to cph- 35 
struct time-stamped records of the activity of each 
workstation associated with each person in the 
environment, such as the creation, reading, modi- 
fication or deletion of electronic fBes; 

means for encoding and storing the 40 
records for later selective retrieval and decoding; 

means for specifying a pattern characteri- 
sing an episode, and for receiving and decoding 
matching records, and 

means for enabling identification of the cd- 45 
lected data from a record. 



12. A system as claimed in daims 4 and 9, in which 
each transmitter has associated with it a micro- 
phone for monitoring the level of the audio signals 
produced by the person carrying the transmitter, 
the microphone circuit having a filter for rejecting 
all signals of less than a threshold level, and 
transmitting data describing periods of vocal 
sounds greater than the threshold level. 

13. A system as claimed in any preceding daim, com- 
pnsing means for producing at least auditory 
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records of what was said In each of the zones. 

14. A system as daimed in any preceding daim cap- 
able of generating episodes of two or more types 
induding means for airiving at episode records 
satisfying a specified relationship between 
episodes of two or more types. 

15. A system as claimed in any preceding claim 
including: ' 

means fo^ specifying a pattern characteri- 
sing an episode as defined, and for receiving and 
decoding notching records; 

means for applying rules to the records to 
anrrve at records of episodes matching a specified 
search pattern, and 

means for requesting transformation into 
human-readable fonn of the episode records cor- 
responding to the specified pattern. 
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@ A system is disclosed for facilitating later 
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